A Method for Short Message Contextualization: Experiments at CLEF/INEX
نویسنده
چکیده
This paper presents the approach we developed for automatic multi-document summarization applied to short message contextualization, in particular to tweet contextualization. The proposed method is based on named entity recognition, part-of-speech weighting and sentence quality measuring. In contrast to previous research, we introduced an algorithm from smoothing from the local context. Our approach exploits topic-comment structure of a text. Moreover, we developed a graph-based algorithm for sentence reordering. The method has been evaluated at INEX/CLEF tweet contextualization track. We provide the evaluation results over the 4 years of the track. The method was also adapted to snippet retrieval and query expansion. The evaluation results indicate good performance of the approach.
منابع مشابه
Three Statistical Summarizers at CLEF-INEX 2013 Tweet Contextualization Track
According to the organizers, the objective of the 2014 CLEFINEX Tweet Contextualization Task is: “...The Tweet Contextualization aims at providing automatically information a summary that explains the tweet. This requires combining multiple types of processing from information retrieval to multi-document summarization including entity linking.” We present three statistical summarizer systems ap...
متن کاملAn Automatic Greedy Summarization System at INEX 2013 Tweet Contextualization Track
According to the organizers, the aim of the 2013 INEX Tweet Contextualization Track is: “...given a tweet, the system must provide some context about the subject of the tweet, in order to help the reader to understand it. This context should take the form of a readable (and short) summary, composed of passages from [...] Wikipedia.” We present an automatic greedy summarizer named REG applied to...
متن کاملThe 2012 INEX Snippet and Tweet Contextualization Tasks
This paper reports on our current experiments involving the Snippet and Tweet Contextualization Tracks of the 2012 INEX competition. Most of this work in snippet generation extends our earlier (2011) approach, described in [4], which produced a top-ranked result. The source of the snippet in these experiments is the top-ranked focused element(s) of the document in question. Another approach is ...
متن کاملTwo Statistical Summarizers at INEX 2012 Tweet Contextualization Track
According to the organizers, the objective of the 2012 INEX Tweet Contextualization Task is: “...given a tweet, the system must provide some context about the subject of the tweet, in order to help the reader to understand it. This context should take the form of a readable (and short) summary, composed of passages from [...] Wikipedia.” We present summarizers Cortex and KL-summ applied to the ...
متن کاملRefining Methodologies for the INEX 2013 Snippet Generation and Tweet Contextualization Tracks
This paper describes our current experiments in snippet generation and tweet contextualization. These experiments are based on work reported in 2011 [2] and 2012 [1] and represent refinements of those earlier techniques. Four of our snippet generation runs produced top-ranked results in the INEX 2012 competition; these serve as the basis for our 2013 experiments in snippet generation. Our 2013 ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015